# Grouped query attention
Mistral Nemo Base 2407 Chatml
Apache-2.0
Mistral-Nemo-Base-2407 is a 12-billion-parameter generative text pre-training model jointly trained by Mistral AI and NVIDIA, outperforming models of similar or smaller scale.
Large Language Model
Transformers Supports Multiple Languages

M
IntervitensInc
191
3
Llama 3.1 70B
Meta Llama 3.1 is a large language model series supporting 8 languages, available in 8B/70B/405B scales, outperforming most open-source and proprietary chat models in industry benchmarks
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
97.35k
358
Mistral 7B Instruct V0.1 Sharded
Apache-2.0
Mistral-7B-Instruct-v0.1 is an instruction fine-tuned version based on Mistral-7B-v0.1, suitable for dialogue generation tasks.
Large Language Model
Transformers

M
filipealmeida
1,363
14
Featured Recommended AI Models